Picture for Zhipeng Wei

Zhipeng Wei

ImageAttributionBench: How Far Are We from Generalizable Attribution?

Add code
May 13, 2026
Viaarxiv icon

Think, Then Verify: A Hypothesis-Verification Multi-Agent Framework for Long Video Understanding

Add code
Mar 05, 2026
Viaarxiv icon

Is Reasoning Capability Enough for Safety in Long-Context Language Models?

Add code
Feb 09, 2026
Viaarxiv icon

EchoingPixels: Cross-Modal Adaptive Token Reduction for Efficient Audio-Visual LLMs

Add code
Dec 11, 2025
Figure 1 for EchoingPixels: Cross-Modal Adaptive Token Reduction for Efficient Audio-Visual LLMs
Figure 2 for EchoingPixels: Cross-Modal Adaptive Token Reduction for Efficient Audio-Visual LLMs
Figure 3 for EchoingPixels: Cross-Modal Adaptive Token Reduction for Efficient Audio-Visual LLMs
Figure 4 for EchoingPixels: Cross-Modal Adaptive Token Reduction for Efficient Audio-Visual LLMs
Viaarxiv icon

OneLoc: Geo-Aware Generative Recommender Systems for Local Life Service

Add code
Aug 20, 2025
Figure 1 for OneLoc: Geo-Aware Generative Recommender Systems for Local Life Service
Figure 2 for OneLoc: Geo-Aware Generative Recommender Systems for Local Life Service
Figure 3 for OneLoc: Geo-Aware Generative Recommender Systems for Local Life Service
Figure 4 for OneLoc: Geo-Aware Generative Recommender Systems for Local Life Service
Viaarxiv icon

Utilizing Jailbreak Probability to Attack and Safeguard Multimodal LLMs

Add code
Mar 10, 2025
Viaarxiv icon

DuMo: Dual Encoder Modulation Network for Precise Concept Erasure

Add code
Jan 02, 2025
Figure 1 for DuMo: Dual Encoder Modulation Network for Precise Concept Erasure
Figure 2 for DuMo: Dual Encoder Modulation Network for Precise Concept Erasure
Figure 3 for DuMo: Dual Encoder Modulation Network for Precise Concept Erasure
Figure 4 for DuMo: Dual Encoder Modulation Network for Precise Concept Erasure
Viaarxiv icon

Emoji Attack: A Method for Misleading Judge LLMs in Safety Risk Detection

Add code
Nov 01, 2024
Figure 1 for Emoji Attack: A Method for Misleading Judge LLMs in Safety Risk Detection
Figure 2 for Emoji Attack: A Method for Misleading Judge LLMs in Safety Risk Detection
Figure 3 for Emoji Attack: A Method for Misleading Judge LLMs in Safety Risk Detection
Figure 4 for Emoji Attack: A Method for Misleading Judge LLMs in Safety Risk Detection
Viaarxiv icon

ReToMe-VA: Recursive Token Merging for Video Diffusion-based Unrestricted Adversarial Attack

Add code
Aug 10, 2024
Figure 1 for ReToMe-VA: Recursive Token Merging for Video Diffusion-based Unrestricted Adversarial Attack
Figure 2 for ReToMe-VA: Recursive Token Merging for Video Diffusion-based Unrestricted Adversarial Attack
Figure 3 for ReToMe-VA: Recursive Token Merging for Video Diffusion-based Unrestricted Adversarial Attack
Figure 4 for ReToMe-VA: Recursive Token Merging for Video Diffusion-based Unrestricted Adversarial Attack
Viaarxiv icon

Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models

Add code
Jul 17, 2024
Figure 1 for Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models
Figure 2 for Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models
Figure 3 for Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models
Figure 4 for Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models
Viaarxiv icon